Automatically deriving structured knowledge bases from on-line dictionaries1

نویسندگان

  • William Dolan
  • Lucy Vanderwende
  • Stephen D. Richardson
چکیده

keywords: computational lexicography; lexical knowledge bases We describe an automated strategy which exploits on-line dictionaries to construct a richly-structured lexical knowledge base. In particular, we show how the Longman Dictionary of Contemporary English (LDOCE) can be used to build a directed graph which captures semantic associations between words. The result is a huge and highly interconnected network of words linked by arcs labeled with semantic relations such as Hypernym, Part_of, Location, and Purpose. We argue that this knowledge base provides much more detailed information about word meanings than can be obtained using standard lexical lookup procedures or by relying on statistical measures of semantic associations among words. 1We would like thank the other members of the Microsoft Natural Language group: Joseph Pentheroudakis, Karen Jensen, George Heidorn, and Diana Peterson.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Clinical Diagnosis Inference through Integration of Structured and Unstructured Knowledge

This paper presents a novel approach to the task of automatically inferring the most probable diagnosis from a given clinical narrative. Structured Knowledge Bases (KBs) can be useful for such complex tasks but not sufficient. Hence, we leverage a vast amount of unstructured free text to integrate with structured KBs. The key innovative ideas include building a concept graph from both structure...

متن کامل

A DL Semantics for Reasoning over OVM-based Variability Models

Software Product Line (SPL) development has traditionally included Variability Management as a way of defining, modelling, implementing and testing variability. In this context, we have created a framework, SeVaTax, based on extensions of the Orthogonal Variability Model (OVM), and aimed at analysing properties of variability models and deriving products from an SPL. Despite several approaches ...

متن کامل

Learning to Extract Relations from MEDLINE

Information in text form remains a greatly underutilized resource in biomedical applications. We have begun a research effort aimed at learning routines for automatically mapping information from biomedical text sources, such as MEDLINE, into structured representations, such as knowledge bases. We describe our application, two learning methods that we have applied to this task, and our initial ...

متن کامل

Automatic Discovery of Preservation Alternatives Supported by Community Maintained Knowledge Bases

Preservation Planning, which deals with selecting the most appropriate preservation action to be applied to digital objects, is an important step in any digital preservation activity. Comprehensive Preservation Planning depends on the availability of identified alternatives of preservation actions, which are for example file format migrations to migrate data in an outdated format to one that ha...

متن کامل

Learning Knowledge Bases for Information Extraction from Multiple Text Based Web Sites

We describe a learning approach to automatically building knowledge bases for information extraction from multiple text based web pages. A frame based representation is introduced to represent domain knowledge as knowledge unit frames. A frame learning algorithm is developed to automatically learn knowledge unit frames from training examples. Some training examples can be obtained by automatica...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1993